Multistrategy Learning for Information Extraction
نویسنده
چکیده
Information extraction IE is the problem of lling out pre de ned structured sum maries from text documents We are in terested in performing IE in non traditional domains where much of the text is often ungrammatical such as electronic bulletin board posts and Web pages We suggest that the best approach is one that takes into ac count many di erent kinds of information and argue for the suitability of a multistrat egy approach We describe learners for IE drawn from three separate machine learning paradigms rote memorization term space text classi cation and relational rule induc tion By building regression models mapping from learner con dence to probability of cor rectness and combining probabilities appro priately it is possible to improve extraction accuracy over that achieved by any individ ual learner We describe three di erent mul tistrategy approaches Experiments on two IE domains a collection of electronic seminar announcements from a university computer science department and a set of newswire ar ticles describing corporate acquisitions from the Reuters collection demonstrate the e ec tiveness of all three approaches
منابع مشابه
Winner-Takes-All based Multi-Strategy Learning for Information Extraction
The proliferation of information on the Internet has enabled one find any information he/she is looking for. Nevertheless, almost all of these informations are designed for human readers and are not machine readable. Information extraction is a task that addresses the above problem by extracting a piece of information from unstructured formats. This paper proposes a winner-takes-all based multi...
متن کاملReening Symbolic Knowledge Using Neural Networks
To use artiicial neural networks as part of a multistrategy learning system, there must be a way for neural networks to accept information, often expressed symbolically, from other systems. Moreover, neural networks must be able to transfer the results of their learning. A three-step process for learning using rule-based domain knowledge in combination with neural networks can address this task...
متن کاملAn Inference-based Framework for Multistrategy Learning
This chapter describes a general framework for multistrategy learning. One idea of this framework is to view learning as an inference process and to integrate the elementary inferences that are employed by the single-strategy learning methods. Another idea is to base learning on building and generalizing a special type of explanation structure called plausible justification tree which is compos...
متن کاملMacro and micro perspectives of multistrategy learning
Machine learning techniques are perceived to have a great potential as means for the acquisition of knowledge; nevertheless, their use in complex engineering domains is still rare. Most machine learning techniques have been studied in the context of knowledge acquisition for well defined tasks, such as classification. Learning for these tasks can be handled by relatively simple algorithms. Comp...
متن کاملIn Machine Learning: a Multistrategy Approach, Vol. Iv Macro and Micro Perspectives of Multistrategy Learning
Machine learning techniques are perceived to have a great potential as means for the acquisition of knowledge; nevertheless, their use in complex engineering domains is still rare. Most machine learning techniques have been studied in the context of knowledge acquisition for well deened tasks, such as classiication. Learning for these tasks can be handled by relatively simple algorithms. Comple...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998